Proceedings of the Fifth Dutch - Belgian Information Retrieval
نویسندگان
چکیده
Todays content is increasingly a mixture of text, multimedia, and metadata. One way to format this mixed content is according to the adopted W3C standard for information repositories, the so-called eXtensible Markup Language (XML). The increasing use of XML in scientific data repositories, Digital Libraries and on the Web, has brought about an explosion in the development of XML tools, and in particular systems to store and access XML content. Whereas many of todays access tools still treat documents as single large (text) blocks, XML offers the opportunity to exploit the internal structure of documents in order to allow for more precise access thus providing more specific answers. Providing effective access to XML-based content is therefore a key issue. Providing effective access to XML-based content is what XML retrieval research is about. XML retrieval systems aim to exploit the logical structure of documents, which is explicitly represented by the XML markup, to retrieve document components (the so-called XML elements) instead of whole documents in response to a user’s query. Implementing this more focused retrieval paradigm means that an XML retrieval system needs not only to find relevant information in the XML documents, but also determine the appropriate level of component granularity to return to the user. It is this goal that makes the development AND the evaluation of XML retrieval approaches challenging. INEX, the Evaluation Initiative for XML Retrieval, is an initiative currently supported by DELOS; a network of excellence in digital libraries (http://www.delos.info/). INEX was set up at the beginning of 2002 with the aim to establish infrastructures, XML test-suites and appropriate measurements for evaluating XML retrieval approaches. INEX is responsible for a range of evaluation activities in the field of XML information access. INEX 2004 had five tasks, where each task was designed to test particular aspects of XML retrieval: ad-hoc retrieval, interactivity, natural language querying, relevance feedback, and heterogeneous content. A large number of institutions (The following institutions contributed to the organization of INEX 2004: U Duisburg-Essen (D), QMUL (UK), U Amsterdam (NL), LIP6 (F), U Otago (NZ), CWI (NL), QUT (AUS), IBM (IL), LIS (DK), U Minnesota-Duluth (US)) are strongly involved in the organization of these activities, by contributing to the evaluation methodologies and providing evaluation software and tools. After three years of INEX, we are still needing further research in order to arrive at the correct methodology for evaluating XML retrieval, in particular with respect to returning the appropriate level of component granularity. This talk will discuss the issues involved in returning the appropriate level of component granularity to the user in the context of INEX, and before INEX, as encountered in some of my previous work. My aim here is to both present the evaluation methodology followed by INEX, and to obtain feedback from the attendees as the issues we are dealing with concern many other areas. For more details about INEX 2004, see http://inex.is.informatik.uni-duisburg.de:2004/.
منابع مشابه
Proceedings of the 7 th Dutch - Belgian Information Retrieval
This talk will present challenges involved in building a Web search engine and will touch on questions of system and algorithm design, in particular as they involve large scale data processing and data mining.
متن کاملReport on the 3rd Dutch-Belgian Information Retrieval Workshop (DIR-2002)
In the Low Countries, interest in information retrieval, the discipline that is mainly concerned with identifying information in document or multimedia collections, has been modest but steady throughout the years. In 2000, this led to the first Dutch-Belgian Information Retrieval Workshop (DIR) at the University of Maastricht (the Netherlands). Two years later, the third edition of DIR shows th...
متن کاملDecisions on embryo disposition in cross-border reproductive care: differences between Belgian and Dutch patients at a Belgian fertility center
Empirical research into cross-border reproductive care is scarce and many facets of the phenomenon are unexplored. The objective of this study was to compare Belgian and Dutch patients regarding the way they perceived the treatment they received and regarding the embryo disposition decisions (EDDs) they made. A questionnaire was sent to patients for whom embryos were cryopreserved at the Ghent ...
متن کاملBelgian Dutch versus Netherlandic Dutch: New patterns of divergence? On pronouns of address and diminutives
The linguistic climate in northern Belgium (Flanders) has been changing in recent years. A new corpus of spoken Dutch meets the need for data reflecting actual and present-day language use in this part of the Dutch language area. The ‘Spoken Dutch Corpus’ allows us to uncover and analyse the present state of colloquial Belgian Dutch and the changes which mark this condition. This paper discusse...
متن کاملAn integration of external information for foreign stallions into the Belgian genetic evaluation for jumping horses.
The aim of this study was to test the integration of external information, i.e. foreign estimated breeding values (EBV) and the associated reliabilities (REL), for stallions into the Belgian genetic evaluation for jumping horses. The Belgian model is a bivariate repeatability Best Linear Unbiased Prediction animal model only based on Belgian performances, while Belgian breeders import horses fr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005